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Up to now, we have found many efficient algorithms for problems in Matchings, Flows, 
Linear Programs, and Convex Programming. All of these are polynomial-time algorithms. 

But there are also problems for which we have found no polynomial-time algorithms. The 
theory of NP-completeness unifies these failures. Roughly speaking, an NP-complete prob- 
lem is one that is as hard as any problem in a large class of problems. For example, the 
Traveling Salesman Problem (TSP), Integer Programming (IP), the Longest Cycle, and 
Satisfiability (SAT) are all hard problems. NP-completeness tells us that they are all, in a 
precise sense, equally hard. Let's look at each problem in a little more detail. 

1. The Traveling Salesman Problem 

Let's say that there exist a salesman that has to visit n cities and there exists a 
distance Wij between cities i and j. He wants to make sure to minimize his traveling 
time by visiting every city exactly once. In other words, there is a complete graph 
G = (V,E) with lengths Wij between nodes i and j. The question we must ask is: 
What is the shortest cycle that visits every node exactly once? 

2. Integer Linear Programming 

Suppose that you have a linear program such as the following: 

min c T x 
Ax < b for xi > 

This is your typical linear program. Now, if you decide to add an integrality constraint 
on Xi such that it is forced to be a positive integer, then you have an Integer Linear 
Program (ILP). 

3. Boolean Satisfiability 

The satisfiability problem (SAT) uses boolean expressions such as the following 

/ = (xi V x 2 V x 4 ) A (x 3 V x 4 ).... 



1 



with Xi = {True, False} and using well known boolean identities. Does F have a 
satisfying assignment? Can we find values of Xi such that every clause in F is equal 
1? 

4. Longest Cycles 

Given a graph G = (V, E), find the longest cycle. 

5. Cliques 

A clique is a complete subgraph. Given a graph G = (V, E), find a clique of maximum 
cardinality (vertices) . 

As different as these examples might seem, they have two main properties in common: 

A) None of them is known to have a polytime algorithm. 

B) If any one of them has a polytime algorithm, then they all do. 

1 Optimization vs Decision 

While property A seems trivial to us all by the inspection of each problem, property B is 
not as easy to see. To understand this property, we first formulate the decision versions of 
these optimization problems. 

Find the optimum among a set of feasible solutions F with cost function c 

vs 

Is there a feasible solution of cost < LI 

If the Optimal (OPT) is solved, then the Decision (DEC) is also solved. Namely, DEC 
reduces to OPT. Now, is OPT reducible to DEC? Well, using the TSP as an example, we 
ask: Is there a tour < LI Then, we proceed to do a binary search in order to find the 
length of the shortest tour, say S. But, we still don't know what the tour is. One way to 
figure this out is to use the following algorithm: 

Take out an edge e. 

Ask if the same graph still has a tour < S. 

If it does, then we don't need that edge and can delete it. 
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If it doesn't, then we keep that edge because it will be part of our tour. 
Repeat this algorithm for all the edges. 

In the ILP example, we can ask: Is there an x of cost < L? Well, one way to do this would 
be to set Xi = and if optimum stays the same, then we can fix that particular Xi to 0. 

For the maximum cliques problem, the OPT problem would be: Find the largest clique, 
while the DEC problem would be: Is there a clique of size < kl To find the optimal size 
k*, again we do a binary search. We then consider the graph with Vi and all its neighbors. 
If the optimum in this graph remains the same, then save that vertex, then we can delete 
all other vertices. Else, delete Vi because it is not in the max clique. 



2 P and NP 

2.1 Definitions 

P : Class of decision problems that can be solved in polytime. 

NP : Decision problems that have a short proof (certificate) for YES answers. The proof 
has length bounded by a polynomial in the size of the input, and its correctness can be 
verified in polytime. 

Note that problems in P have short proofs for both YES and NO answers. This means that 
P C NP. Let's look at a problem in P: 

Linear Programming: Is the minimum less than some c? 

YES: Give a feasible solution < c 

NO: Use the Dual of the problem to give a lower bound. 
Now, let's look at the following examples of NP problems: 

1. TSP, Is there a tour < LI 
YES: Give a tour 

NO: ? 

2. SAT, Does there exist a satisfying assignment? 
YES: Give a satifying assignment 
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NO: ? 

3. Min ILP, Is the minimum < c? 

YES: Give a feasible solution that is < c 
NO: ? 

This leads to the question: Is P = NP7 

2.2 Reductions 

A reduction from a problem A to a problem B is a function / : A — > B such that for all 
instances x 

X £ A4^ f{x) £ B. 

If the function / can be computed in polynomial time, then it is called a polynomial-time 
reduction. An implication of this is the following: 

If there exists a polytime algorithm for B, then there exists one for A. 

A problem B is NP-hard if every problem in NP has a polytime reduction to B. If, in 
addition, B is in NP, then it is NP-complete. 

Thus if A is NP-complete, and it has a reduction to another problem B in NP, then B is 
also NP-complete. 

2.3 Examples of Reduction 

SAT is NP-complete (we will not prove this in class). 

1. ILP is NP-complete Let's take the following SAT problem and see if it can be solved 
by an ILP. 

F = (xi V x 2 V ... V Xi) A (x 4 V x 5 ) A ... A (x a VxjV ... V x c ) 
This SAT problem can also be written in the following way 

X\ + X2 + ... + Xi > 1 

xa + £5 > 1 

X a + X b + ... + X c > 1 

4 



XI, 3 
■X3, 3 
>X5, 3 

Figure 1: clique is NP-complete 

i 1, then true 
Xi= \ 

y 0, then false 
Since SAT can be reduced to an ILP, ILP is NP-complete. 

2. Clique is NP-complete 

SAT can be reduced to clique by the following construction. Suppose we have a 
formula F with m clauses. 

1) Vertices are going to be of the form < x a ,i > where literal that occurs in 
clause Cj 

2) Edges are going to be of the form {< x a , i >, < Xf,,j >} for all x a / and i / j. 

By defining the vertices and edges this way, we ensure that all the connected vertices 
are compatible, since their truth values won't overlap. If we find a clique of size m in 
this graph, F is satisfiable. Refer to Fig. 1. 



XI + X.' + X4 ~ X5 
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